MAC design for WiFi infrastructure networks: 
a game-theoretic approach 
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Abstract — In WiFi networks, mobile nodes compete 
for accessing a shared channel by means of a random 
access protocol called Distributed Coordination Function 
(DCF). Although this protocol is in principle fair, since 
all the stations have the same probability to transmit 
on the channel, it has been shown that unfair behaviors 
may emerge in actual networking scenarios because of 
non-standard configurations of the nodes. Due to the 
proliferation of open source drivers and programmable 
cards, enabling an easy customization of the channel access 
policies, we propose a game-theoretic analysis of random 
access schemes. Assuming that each node is rational and 
implements a best response strategy, we show that efficient 
equilibria conditions can be reached when stations are 
interested in both uploading and downloading traffic. 
More interesting, these equilibria are reached when all the 
stations play the same strategy, thus guaranteeing a fair 
resource sharing. When stations are interested in upload 
traffic only, we also propose a mechanism design, based 
on an artificial dropping of layer-2 acknowledgments, to 
force desired equilibria. Finally, we propose and evaluate 
some simple DCF extensions for practically implementing 
our theoretical findings. 



I. Introduction 

The problem of resource sharing in WiFi net- 
works HI, B21, is addressed by the Distributed Co- 
ordination Function (DCF), which is a Medium Ac- 
cess Control (MAC) protocol based on the paradigm 
of carrier sense multiple access with collision avoid- 
ance (CSMA/CA). The basic idea of the protocol is 
very simple: sensing the channel before transmit- 
ting, and waiting for a random backoff time when 
the channel is sensed busy. This random delay, intro- 
duced for preventing collisions among waiting sta- 
tions, is slotted for efficiency reason and extracted 
in a range called contention window. Standard DCF 
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assumes that the contention window is set to a 
minimum value (CW m i n ) at the first transmission 
attempt and is doubled up to a maximum value 
(CW max ) after each transmission failure. 

The distributed DCF protocol is in principle fair, 
because the contention window settings CW m ; m and 
CW max are homogeneous among the stations, thus 
ensuring that each node receives in long term the 
same number of access opportunities. Nevertheless, 
some unexpected behaviors have been recognized 
as a consequence of non-standard settings of the 
contention windows. In fact, stations employing 
lower contention windows gain probabilistically a 
higher number of transmission opportunities, at the 
expense of compliant stations. These settings can be 
performed by the card manufacturers, as recognized 
in 0, or by the end users thanks to the availability 
of open- source drivers. 

Another problem specific to infrastructure net- 
works is given by the repartition between uplink 
and downlink resources. Infrastructure networks are 
characterized by a star topology, which connects 
multiple mobile nodes to a common station called 
Access Point (AP). On one side, mobile stations 
can upload traffic to the AP, which is connected to 
external networks (e.g. to the internet); on the other 
side, they can download traffic from the external 
networks through the AP. Since the AP contends as 
a normal station to the channel, its channel access 
probability is the same of other mobile stations. 
This implies that the AP aggregated throughput, i.e. 
the downlink bandwidth, is equal to the throughput 
perceived by all the other stations, thus resulting 
in a per-station downlink bandwidth much lower 
than the uplink one [4J. Indeed, recent extensions 
of DCF (namely, the EDCA protocol) allow the 
AP to set heterogeneous contention windows among 
the stations to give priority to downlink throughput 
or to delay- sensitive traffic. Thus, nowadays nodes 
can adapt their contention windows according to the 
values signaled by the AP for each traffic class. 
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However, there is the risk to exploit this adapta- 
tion in a selfish manner, for example by using a 
contention window value of an higher priority class 

m. 

These considerations motivate a game theoretical 
analysis of DCF, in order to propose some protocol 
extensions able to cope with the current resource 
sharing problems. The problem can be formulated 
as a non cooperative game, in which the contending 
stations act as the players of the game. When sta- 
tions work in saturation conditions, i.e. they always 
have a packet available in the transmission buffer, 
DCF can be modeled as a slotted access protocol, 
while station behavior can be summarized in terms 
of per-slot access probability 0. 

Let Tj be the per-slot access probability repre- 
senting the access strategy of a generic station i. 
The channel access game can be formulated by 
considering: n players, the set of strategies r = 
(ri,i = 1, . . . ,n) in [0, 1]™, and the station payoff 
( Ji, J 2 , . . . , J n ), that can be defined according to 
the network and application scenario flU. Previous 
studies have mainly considered that such a utility is 
given by the node upload throughput performance 
O. In [[TOll , it has been shown that a utility func- 
tion equal to the upload throughput may lead to 
an inefficient Nash equilibrium in which stations 
transmit in every channel slot (i.e. play r = 1). 
This situation creates a resource collapse, because 
all stations transmit simultaneously thus destroy- 
ing all packet transmissions. More complex utility 
functions combining upload throughput and costs 
related to collision rates ifTOll . [fTT]|. [fT2| or to energy 
consumptions lfl"3ll . |[T4| lead to different equilibria, 
but they implicitly assume that all the nodes have 
the same energy constraints or collision costs. Other 
utility functions, which lead to efficient equilibria, 
appear less natural lfl"5l . because they are not related 
to physical parameters, such as throughput. 

In this paper, we show that efficient equilibria 
conditions can be naturally reached when stations 
are interested in both upload and download traffic. 
Since the utility of each station depends not only 
on its throughput but also on the AP throughput, 
no station is motivated to transmit continuously. 
We propose a game theoretic analysis of DCF in 
infrastructure networks, when all the stations have a 
desired ratio between uplink and downlink through- 
put. Assuming that each station tunes its access 
probability according to a best response strategy, 



extending our preliminary results in |[T6l and in 
IfPTll . we derive Nash equilibria and Pareto optimal 
conditions as a function of the network scenario. 
We also define a mechanism design scheme, in 
which the AP plays the role of arbitrator to im- 
prove the global performance of the network, by 
forcing desired equlibria conditions. We propose to 
extend current DCF operation by implementing our 
theoretical best response strategies. To this purpose, 
we develop some channel monitoring functionalities 
(similarly to lfT8l . Ifl9l0 . devised to estimate the 
network status and to run-time drive the strategy 
adaptations. Finally, we evaluate the effectiveness 
of our scheme by means of extensive simulations. 

The rest of the paper is organized as follows. 
In section [n] we carry the game theoretic analysis 
and we find the Nash equilibria and the Pareto 



Optimal solutions; in section III we design the 
DCF extension; in section [IV] we show the MAC 
scheme implementation and the performance eval- 
uation trough simulations; finally we drew some 
conclusive remarks in section |V] 

II. Contention-based channel access: a 

GAME THEORETIC ANALYSIS 

We assume that all the stations try permanently 
to transmit on the channel, i.e. they work in sat- 
uration conditions. In fact, we have verified that 
non- saturated stations affect the performance of 
saturated stations only marginally and regardless 
of their contention windows. When all stations are 
saturated, it has been shown [20] that DCF can 
be accurately approximated as a persistent slotted 
access protocol, because packet transmissions can 
be originated only at given time instants. Figure [T] 
shows an example of DCF as a slotted protocol. 
After a busy time, stations A and B defer their 
transmissions by extracting a random slotted delay 
(respectively, 4 and 8 slots). Since station A expires 
its backoff first, it acquires the right to transmit on 
the channel. The next transmission, which results 
in a collision, is performed again after an integer 
number of backoff slots from the end of the previous 
channel activity. Therefore, the channel time can be 
divided into slots of uneven duration delimited by 
potential transmission instants. In a generic channel 
slot, each station has approximately a fixed proba- 
bility r to transmit (in the example, 1/5 for station 
A and 1/10 for station B)), which depends on the 
average backoff values. 
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A. Station strategies 



where /() has been derived in (0): 



2(i-p r+1 ) < o < 1 



Let n be the number of saturated contending sta- 
tions. We assume that each station i is rational, and 
can arbitrarily choose its channel access probability 
Ti in [0, 1]. This choice can be readily implemented 
by tuning opportunistically the minimum and the 
maximum values of the contention windows. By 
observing that r* = 1/(1 + E[W}/2), where E[W] 
is the average contention window used by station, a 
solution is to set CW l min = CW l max = 2/n - 2. 

The set of all the strategies in the network is then 
[0, We define an outcome of the game a specific 
set of strategies taken by the players, then a vector 
r = (ti,T2, ••• , r n ) G [0, l] n . We define that an 
outcome is homogeneous whenever all the stations 
play the same strategy, i.e. r = (r, r, ...r). 

Being all the player requirements homogeneous, 
the equilibria we are going to define are invariant 
to player permutations. For this reason we define 
two outcomes r a and r b equivalent if they can be 
obtained one from the other through an opportune 
permutation of the indexes and we write r a ~ r b . 
We denote a class of equivalent outcomes as {r}, 
where f is an ordered vector, i.e. a vector with 
increasing component (fi < f% < ■ ■ ■ f n ). If A is a 
set of ordered vectors, then {A} denotes the union 
of the classes of equivalence of the vectors in A. 

Performance perceived by a given station i not 
only depends on the probability to access the 
channel, but also on the probability that no other 
station interferes on the same slot. Therefore, from 
the point of view of station i, the vector strategy r 
can be represented by the couple of values {r^pA, 
where p± — 1 — [^^(1 — Tj), the probability that 
at least another station transmits, summarizes the 
interactions with all the other mobile stations. 

In presence of downlink traffic, we also assume 
that the AP contends for the channel as a legacy 
DCF station with saturated downlink traffic. Thus, 
the overall collision probability suffered by station i 
results 1— (1— Pi)(l— tap), where tap is the channel 
access probability employed by the AP. Since the 
AP is a legacy station, its transmission probability 
is not chosen by the AP, but is function of the 
perceived collision probability p A p, r A p = f(pAp), 



2(R+1) 



i+£f=o w(i) 



P 



(1) 

where R is the retry limit employed in the network 
and W(i) is the contention window at the i th 
retry stage (i.e. W{i) = mm{2 i CW min , CW max }). 
We can evaluate the AP collision probability as a 
function of the vector strategy r or as a function of 
a generic couple (r^Pi): 



Pap 



n< 

i=i 



Ti 



(l- Pi )(l-0 



B. Station Utility 

According to the slotted channel scale, the ran- 
dom access process can be described as a sequence 
of slots resulting in a successful transmission (when 
only one station accesses the channel), in a collision 
(when two or more stations access the channel), 
or in an idle slot (when no station accesses the 
channel). By observing that each slot boundary 
represents a regeneration instant ll2Tll for the access 
process, the throughput of each station can be 
readily evaluated as the ratio between the average 
number of bits transmitted in each slot and the 
average duration of each slot 112011 . 

Assuming that the AP equally shares the down- 
link throughput among the stations, we can express 
the uplink throughput S l u and the downlink through- 
put for the i-th station as HI: 



Ji,Pi) 



T i (l-p i )(l~T A p)P 
Pidle® + [1 — Pidle\T 

1 T A p{l~PAp)P 
U PidleO- + [1 - Pidle]T 



(2) 



(3) 



where P is the frame payload which is assumed to 
be fixed, a and T are, respectively, the empty and 
the busy slot duration^ and Pidie is the probability 
that neither the stations, nor the AP transmit on the 
channel, i.e. P idie = (1 - p AP )(l - t ap ). 

Since the downlink throughput is equal for all the 
stations, we can avoid the i superscript in ([3]). We 

'We are implicitly considering a basic access scheme, with 
EIFS=ACK_Timeout+DIFS, which corresponds to have a fixed busy 
slot duration in both the cases of successful transmission and colli- 
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define the utility function Jj for the mobile station 

i as: 

J t = min{SlkS d } (4) 

The rationale of this definition is the assumption 
that the station applications can require bandwidth 
on both directions. The coefficient k £ [0, oo) takes 
into account the desired ratio between the uplink 
and the downlink throughput. We assume that the 
application is the same for all the stations, thus using 
a fixed k value for all the utility functions. When 
k = 1 all the stations require the same throughput 
in both directions. Note that k = oo corresponds 
to a unidirectional traffic case, in which stations 
are not interested in downlink throughput and their 
utility is simply given by the uplink throughput (as 
assumed in most previous literature). Figure [2] plots 
the utility of a given station i for k = 1, in case of 
802.11b physical (PHY) layer, P = 1500 bytes, a 
data rate equal to 1 1 Mbps, and an acknowledgment 
rate of 1 Mbps. In such a scenario, by including 
physical preambles, acknowledgment transmissions, 
MAC headers and interframe times, the T duration 
is equal to 1667 fxs. Different network conditions, 
summarized by different values of the pi probability, 
have been considered. The collision probability Pi 
takes into account only the competing mobile sta- 
tions, so that the actual collision probability is given 
by 1 - (1 -Pi)(l - T AP (pi, n)). From the figure, it 
is evident that, for each pi, the utility is maximized 
for a given best response value (e.g. about 0.01 for 
p = 0.15), which slightly decreases as pi grows. 
We also consider the single variable functions 



S 



horn i 



r) = S u (r, 1 - (1 - r 



and S. 



horn I 



S d {r, (1 — r)" ) representing, respectively, the up- 
link and downlink throughput perceived by each 
station in case of homogeneous outcomes (t|ti = 
r,Vz). Figure [3] plots the utility of a given station 
in case of homogeneous outcomes for n = 2 and 
n = 10, and for different k values. In these curves 



all stations achieve an utility value equal to can 
be easily found by observing that: 

Remark 2.1: In general, station i utility is a func- 
tion of the whole set of strategies (r), but it is 
constant and equal to if a) pi = 1, i.e. if at 
least one of the other players is transmitting with 
probability 1 (3j ^ i \ Tj = 1), or if b) T{ = 0. 
The AP access probability tap depends on T{ and 
Pi according to ([T|) and cannot be equal to 1 for 
standard contention window values. 

Proposition 2.1: The vectors of strategies r, 
such that 3 i, j E 1,2, ■■■ n \ Ti = I, Tj = I are NE 
of the distributed access game in which all stations 
achieve an utility value equal to 0. 

Proof: The result is an immediate consequence 
of Remark 12.11 If there are at least two stations 
transmitting with probability 1, then the channel is 
entirely wasted because of collisions and S l u = S d = 
0, VZ. In these conditions, J\ = V/ and stations are 
not motivated in changing their strategies. The equi- 
librium strategies are then {(x, 1, 1), a: £ [0, l] n_2 }. 

■ 

The following remark will be useful for charac- 
terizing more efficient NE. 

Remark 2.2: Consider a generic station % and the 
collision probability p { £ (0, 1) suffered because of 
the other station strategies. By derivation, it can be 
easily proved that Sd{ji,p-i) is a monotonic decreas- 
ing function of r i5 starting from S d (0,pi) > 0, and 
that S l u (Ti,pi) is a monotonic increasing function of 
Tj, starting from S l u (0,pi) = 0. 

Let us consider now the best response strategy 
of a station i and denote it by t^. For k = 
oo, the station utility fun ction is equa l only to 

s: 



From Remarks 2.1 and 



2.2 

(SrT 



that the utility is maximized for t- 
Pi < 1, and it is constant to when pi 



\n— 1 



it results 
1 when 
= 1. For 

k < oo, from Remark |2.2| we can state that the 
utility Jj is maximized for t^ £ (0, 1) such that 

S^ br) , Pl ) = kS d ^ br) , Pl ). It follows that t^ is 



is not fixed, because the strategy the solution of the following implicit equation: 



Pi = 1-U 

changes are not unilateral. The optimal strategy, 
which maximizes the station utility, is function of 
both n and k. 



C. Nash Equilibria 

We are interested in characterizing Nash Equil- 
bria (NE) of our game model where stations achieve 
a non-null utility. The inefficient equilibria in which 



(br) 



n—(n—k)TAP 

fc/(l-(l-Pi)(l-7f r) )) 

n-(n-fc)/(l-(l- Pi ) (l-rf r) ) .) 



(5) 



It can be shown that the previous equation has a 
single solution r* in the range (0, 1), which can be 
numerically solved in a few fixed point iterations. 

Proposition 2.2: In case of unidirectional traffic 
(i.e. k = oo), the NE of the distributed access game 
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with non-null utility values are all and only the vec- 
tor of strategies r, such that 3\ i E 1,2, ■■■ n \ r,i = 
1. 

Proof: Being i the index of the single sta- 
tion transmitting with probability 1, it follows that 
Pidie = and pi 7^ 1. Therefore, station i achieves a 
non-null uplink rate equal to S l u = P(l — Pi)/T = 
Ji. All the other stations achieve Si = for 
j % regardless of their strategy Tj. The vectors 
of strategies {(x,l),x E [0, l) n_1 } are then Nash 
Equilibria. Any other vector of strategies r, such 
that Tj 7^ IVi cannot be a NE, since for Pi ^ 1, a 
generic station i can improve its utility by increasing 
its strategy up to t { = 1. ■ 

The vectors of strategies {{x,l),x E [0, 
are not NE for k < oo. In fact, being p^p = 1 — 
EK=i(l — T h) — h f° r these vectors of strategies 
the downlink throughput Sd is equal to 0. Thus, the 
single station i playing Tj = 1 is motivated to reduce 
its transmission probability to a value lower than 1, 
in order to increase its utility Jj = minis'* , Sd}. 

Proposition 2.3: In case of bidirectional traffic 
(i.e. k < oo), the homogeneous strategy vector 

,Vz) is the only 



IT Ti 



T 



fc/(l-(l-T*)") 



n-(n-k)f(l-(l~r*) n ) • 

NE with non-null utility values of the distributed 
access game. 

Proof: The strategy vector (t*,t*,--- , t*) is 
a NE as an immediate consequence of ([5]) for 
Pi = 1 — (1 — r*) n_1 . In fact the equation can 
be read as a mutual best response. Since equation 
Q has a single solution, there exists a unique 
homogeneous NE strategy. Since (1 — pj)(l — Tj) = 
(1 — Pj)(l — Tj),Vi,j, the right hand of the best 
response equation ([5]) is the same for all the stations. 
This excludes the existence of non-homogeneous 
Nash equilibria. ■ 
The parameter r*, which characterizes the NE 
strategy with non-null utility only depends on the 
number of stations n and desired ratio k between 
uplink and downlink throughput. It is not affected 
either by the PHY layer parameters (such as backoff 
slot duration, interframe spaces, etc.) or by the 
frame length. 



D. Social utility 

In this section, we try to identify desirable out- 
comes from a global point of view. A natural choice 
is to look at outcomes that maximize a global utility 



function, such as the minimum utility Js(r) per- 
ceived in the network: min Jj. This global utility 

i=l"-n 

is often referred to as social utility. The following 
remark will be useful for such a characterization. 

Remark 2.3: The uplink throughput S^ om (r) per- 
ceived in case of homogeneous outcomes is a non- 
monotonic function in r, with a single maximum 
value ^(rj, for r x E (0,1). 

Proposition 2.4: The social utility is maximized 
for an homogeneous outcome (r', t', • • • , r') and 
such outcome is Pareto Optimal. 

Proof: From the utility definition, we have that 
the minimum utility perceived in the network is 
given by min{ min S l u ,kSd}. Therefore, the min- 

i=l, ---n 

imum utility can be due to the minimum uplink 
throughput or downlink throughput. 

We observe that the minimum uplink throughput 
is maximized for a homogeneous outcome. In fact, 
if r is a non-homogeneous outcome in A = {x, x E 
(0, l) n }, we can prove that the minimum uplink 
throughput cannot be maximized. Without loss of 
generality, we consider that r is an ordered vector, 
with < 7~i < t 2 < ■ ■ ■ < r n < 1 and 5„(ri,pi) = 
minj S l u (Ti,pi). Let r[ be a new strategy for station 
1, such that < T\ < r[ < r 2 . For the new outcome 
t' = (r[, t 2 , ■ ■ ■ r n ), the minimum uplink throughput 
is still the throughput perceived by station 1. This 
throughput is higher than the previous one, since 
S l u (ri,pi) is monotonic increasing in Tj. It follows 



that SI(t[, Pi ) 



min i S';(r i ,p-) > S£(7i,pi) = 
minj S^ {t^ pi). Therefore, r has to be homoge- 
neous. 

We also observe that when all the station utilities 
are limited by the downlink throughput, i.e. S % u > 
kSd Vi, Sd is maximized for an homogeneous 
outcome. In fact, a non-homogeneous outcome can- 
not maximize Sd, because it is always possible to 
increase the AP throughput, by slightly reducing a 
given channel access probability Tj while maintain- 
ing Si > kSd (i.e. while maintaining r, > t^). 

We can conclude that the social utility is max- 
imized for an homogeneous outcome. For homo- 
geneous outcomes {t\t^ = r, Vi), the social utility 
r) is a single variable function and can be 



Thorn 
J S 



expressed as min{S , ^ om (r), fcS^ om (r)}. Being r* 
the homogeneous NE strategy for which S^ om {r) = 



kS^ om [ 
in remark 2.3 
with t' 



t) , and t x the homogeneous strategy defined 
J^ om (r) has a single maximum in r', 



s 

for t* 



< t x , or t = t x when t x < t*. 
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Therefore, Jg orn {r') is the maximum social utility 
achievable in the network. 

Finally, we prove the Pareto optimality. We recall 
that a Pareto optimal outcome is one such that 
no-one could be made better off by changing the 
vector of strategies without making someone else 
worse off. Since the homogeneous strategy vector r' 
maximizes the minimum utility perceived in the net- 
work, any other vector of strategies different from 
t' will degrade the utility of the station perceiving 
the minimum performance. Let r be an outcome 
different from r', with at least a player better 
off than in t' . This allocation is necessarily non- 
homogeneous because, for homogeneous outcomes, 
the maximum utility is reached in r' . Since for 
non-homogeneous outcomes the minimum utility is 
lower than Jg om (-r'), it exists a station j (perceiving 
the minimum utility) whose payoff is lower at r 
than at t'. It follows that t' is a Pareto outcome. 

■ 

Figure |3] shows some examples of J hom (r), where 
the maximum utility value J^ om (r') is limited by the 
uplink throughput (i.e. r* > r x ) or by the downlink 
one (i.e. r* < t x ). Note that the intersection between 
the function S£ om (r) and the function fc^ om (r) 
depends on k. Figure [3] shows that the intersection 
strategy r*, for which the utility function has an 
abrupt slope change, grows as the k value increases. 
Let k x be the value of k for which r* = r x . For 
k > k x , j£ om (r') is equal to ^ om (r') and results 
maximized. 

Proposition 2.5: If the solution r* of ([5]) for pi = 
1 — (1 — r*) n-1 is lower or equal to r x , the NE 
(r*, r*, • • • r*) is Pareto optimal. 

Proof: For r* < t x , the NE is the homogeneous 
outcome that maximizes the minimum network util- 
ity. Therefore, from the previous preposition, it is 
also Pareto optimal. ■ 

Figure [3] shows that the limit condition r* = t x 
is approximately reached for k x = 20 in case of 
n = 2, and for k x = 11 in case of n = 10. 
For smaller k values, the homogeneous NE r* is 
Pareto optimal. For larger k values, including the 
unidirectional traffic case k = 00, the Pareto optimal 
outcome t' is not an equilibrium point and the NE 
r* gives poor performance (i.e. performance much 
worse than j£ om (r')). 



III. Channel access mechanism design 

Our previous considerations about equilibrium 
conditions and Pareto optimality have shown that 
the global performance of the distributed access 
scheme strongly depends on the desired ratio k be- 
tween downlink and uplink throughput. In fact, the 
station utility perceived in equilibrium conditions 
and the maximum social utility depend on k, i.e. 
on the intersection point between the curves S^° m 
and kS^ om . However, such a result is based on 
the assumption that the Access Point behaves as 
a legacy station. In this section, we explore the 
possibility to use the Access Point for changing 
the Sd or S u functions, in order to force desired 
equilibrium outcomes. Indeed, since the AP plays 
the role of gateway to external networks, it can also 
play the role of arbitrator for improving the global 
performance of its access network. 

A. Tuning of the AP channel access probability 

A first solution for changing the uplink and 
downlink throughput curves is to use the AP channel 
access probability tap as a configuration parameter. 
Then, tap does not depend on r according to ([T|), 
but it is equal to a fixed value, which can be tuned 
by the AP. The best response ([5]) for all the stations 
is equal to 



n — [n — k)tap 

and the NE in (0, l) n becomes (r + , r + , ■ ■ • t + ). 

Figures [4] and [5] show the effects of the tap tuning 
on the utility perceived for homogeneous outcomes. 
We considered a scenario with 10 contending sta- 
tions, a packet size of 1500 byte, and k = 1. Each 
labeled curve refers to a different tap setting, as 
indicated in the legend. For each curve, the NE 
corresponds to the cuspid point. For comparison, 
the figures also plot a bold dashed curve for the 
case in which the AP behaves as a legacy station. 
Figure [4] has been obtained for an 802.11b PHY 
layer at the maximum transmission rate (namely, 
11 Mbps). In this case, the utility perceived with a 
legacy AP at the NE is about the same perceived 
with a fixed tap = 0.064. However, when the 
packet transmission time is shorter, it is possible to 
improve the NE utility by opportunistically tuning a 
fixed tap value. Figure [5] has been obtained for an 
802.1 In PHY layer at the maximum rate (namely, 
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600 Mbps). In this case, the difference between the 
NE utility perceived under a legacy AP and under 
a fixed T A p = 0.168 is about 0.9 Mbps. 

Since each different tap setting leads to a dif- 
ferent homogeneous NE, given a desired NE it 
is possible to design the corresponding tap value 
by inverting We can express the utility J NE 
perceived at the NE as a function of the desired 
homogeneous NE (r, r, ■ ■ ■ r). By considering that 
at the NE the utility can be expressed as S^ om , (or 
equivalently as kS^ om ), after some manipulations it 
results: 



J ne (t) 



rfl - tYP 



T - (1 - r^+^T - a) + ^Tt 



(7) 



Figure [6] plots some examples of the NE utilities 
J NE achievable for n — 10, P — 1500 bytes, and 
different k values, in the case of 802.11b PHY at 
11 Mbps. By deriving ffi), it can be shown that, for 
k 7^ 0, the function J N has a unique maximum in 
t E (0, 1). Such desired maximum can be obtained 
by setting a specific tap d value. Although an exact 
computation of tap q is in principle possible, by 
deriving J NE and inverting an approximated 
optimal tuning can be derived as follows. In 0, 
it is shown that for a network with N competing 
stations the optimal channel access probability is 



given by 



In our scenario, at the NE out- 



Ny/T/2a' 

come, the AP downlink throughput corresponds to 
the aggregation of n flows, whose bandwidth is a 
fraction 1 jk of the uplink throughput perceived by 
each station. Therefore, we can consider the AP 
channel access probability as the aggregation of n/k 
flows in a network with n+n/k contending stations. 
It follows: 



n 



T AP a 



k (n + n/k) JT '/2a (n + kn)JT/2a 

(8) 

which leads to the desired NE outcome 

k 



(kn + n)^/T/2a - (n - k)' 

Figure [6] visualizes the accuracy of the proposed 
approximation by plotting the points (f , J NE (f )) 
(white boxes) for different k values. The proposed 
approximation has an important practical implica- 
tion. In fact, by simply specifying the application 
requirements k and estimating the number of con- 



tending stations n (as described in section IV), the 



access point can tune its channel access probability 
to tap q , thus forcing the network to the desired 
equilibrium point. We also report the NE utilities 
perceived under legacy AP (black boxes). Compar- 
ing white and black boxes, it is evident that in many 
cases the network performance under a legacy AP 
are suboptimal (i.e. the NE utility is lower than the 
maximum of the J NE curves. 

The figure also plots the limit curve obtained 
for k — > oo, which represents the system behav- 
ior when the application requirements tend to the 
unidirectional traffic case. In this case, the J NE 
expression given by [7J tends to the uplink through- 
put expression. However, this curve is practically 
unfeasible because for k — > oo tap tends to zero 
for any desired NE, and cannot be used as a tuning 
parameter. In other words, although f tends to the 
finite value 1/ [n^jT /2a + 1) maximizing the social 
utility, the mechanism design cannot be practically 
performed. When T A p ^ 0, such as in the legacy 
case, for k — > oo the best response of each station 
tends to 1 (as shown in the black boxes of figure 



B. ACK suppression 

The mechanism design described in the previous 
section is unfeasible when k = oo. Moreover, when 
k is very high (i.e. tap tends to zero), stations 
need a long estimation time for correctly evalu- 
ate their best response. A solution for controlling 
the resource repartition in infrastructure networks 
with negligible (or zero) downlink throughput is a 
selective discard of the ACK transmissions at the 
AP side. Since the AP is the common receiver 
for all stations, suppressing the ACKs at the AP 
side corresponds to triggering ACK timeouts at the 
station side, which are interpreted as collisions. 
Therefore, the ACK dropping can act as a punish- 
ment strategy devised to limit the uplink throughput 
of too aggressive stations. We propose the following 
threshold scheme: if a generic station i has an access 
probability higher than the a given value 7, the AP 
drops an ACK frame transmission with probability 
min{a(ri - 7), 1}. 

In this case, by considering T A p = (i.e. the 
unidirectional traffic case) or tap — 0, the utility 
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function J, of a given station % can be expressed as: 



< Ti < 7 



-Pid!eO'+[l-'PidI e ]r 

/•fr- 7) A = I n(l-Pi)[l-Q(Tj-7)] _, _ ^ „ i 1 /„ 

^ P ldle a + [l-P me ]T 7<Ti<7+l/« 
7 + l/«<r,<l 

(9) 

where we recall that = (1 — Ti)(l — pi). 

According to the previous expression, for 7$ < 7 
the utility function Jj is an increasing function of 
while for Tj < 7 its slope depends on the a 
setting. By selecting an a value which corresponds 
to a negative derivative for 7 < Ti < 7 + 1/a, the 
utility function is maximized for = 7. 

Figure [7] plots the station utility perceived in case 
of homogeneous outcomes, under the ACK suppres- 
sion scheme, for n=10 and different a values. For 
a = 0, the station utility is simply given by the 
uplink throughput. In this case, the utility is an 
increasing function of the channel access probability 
and the best response of each station is = 1. 
For a > 0, the utility function is maximized for 
T (br) < 1 Such a maximum corresponds to 7 for 
large enough values of a (in figure, a = 80). 

We can then prove the following result. 

Proposition 3.1: The outcome (t'Itj = r', Vi) is 
a Pareto optimal Nash equilibrium of the game, 
when the ACK suppression scheme indicated above 
is implemented with 



7 = r 
a > 



t'(1 + t'(-1 + 



r 



T-{T-s)(1-t>) 



l_ T /\n-l)) 



• (10) 



Proof: First we observe that t' is a NE. In fact, 
whatever player we consider, say it player i, Re- 
mark 2.1 guarantees that for r< < r' Jj decreases as 



Tj decreases. For t' < Ti < r' +l/a inequality (10) 



guarantees that Jj decreases as Tj increases until it 
does not reach the value 0. For r > r' + 1/a, the 
punishment strategy implies Jj = 0. Then deviating 
from t' is not convenient for player i. 

Second, r' is the unique point of maximum for 



the social utility J, 



hom I 



In fact, the social utility 



is given by the minimum utility perceived in the 
network, which is given by S^ om (T) in the absence 
of punishment. The punishment strategy can only 
decrease the social utility for r G [0, l] n — [0, r'] n 
and keeps it unchanged otherwise. Therefore, the 



preposition 3.1 holds also under the punishment 
strategy and the outcome r' is Pareto optimal. ■ 



The utility J NE perceived at the NE can be 
simply expressed as the uplink throughput perceived 
in case of homogeneous outcome (t|tj = rVi): 



J 



NE I 



"(1 



>P 



(ID 



[1 - r) n a + (1 - (1 - r) n )T 

In it is shown that such a function has a unique 
maximum for r E (0, 1), that can be approximated 

1 

(12) 



n 



Therefore, by estimating the number of contending 
stations n, each AP can implement (as described in 
section [TV]) an ACK suppression scheme that forces 



the system to work on the NE r G maximizing the 
network throughput. 

IV. Game-based MAC scheme: 
Implementation and Evaluation 

On the basis of the results discussed in the 
previous sections, we propose some simple DCF 
extensions devised to enable each contending sta- 
tion to dynamically tune its channel access prob- 
ability according to a best response strategy. For 
this purpose, each station needs two estimators for 
probing the uplink and downlink load conditions. In 
fact, station best response depends not only on the 
application requirements (by means of k), but also 
on the uplink load (by means of n) and downlink 
load (by means of tap)- 

We consider both the case in which the AP 
behaves as a legacy station, and the case in which 
the AP acts as a game designer, for forcing de- 
sired equilibrium conditions. In this second case, 
we assume that also the AP is able to estimate 
the uplink load (by means of n) and the channel 
access probability employed by each station. 
These estimates are then used for opportunistically 
tuning the AP channel access probability or the 
ACK suppression scheme (by means of 7 and a). 

A. Network load estimators 

In contention-based networks, network load can 
be simply related to channel observations. Consid- 
ering the slotted channel model due to saturation 
conditions, a channel observation corresponds to the 
channel outcome observed into a given slot. Such 
outcome is given by an idle slot when no station 
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transmits, by a successful slot when a single station 
transmits, by a collision slot when two or more 
stations transmit simultaneously. In order to perform 
run-time estimators, the channel observations can 
be grouped in regular observation intervals at which 
new measurement samples are available. We express 
the measurement intervals in terms of an integer 
number B of channel slots. Since slot size is uneven 
(because successful and collisions slots last for a T 
time, while idle slots last only for a), the actual 
time required for a new measurement sample is not 
fixed. 

For measuring the number of stations actually 
contending on the network, we propose to count the 
number of different transmitters observed during the 
measurement interval 0. Each transmitter can be 
identified by means of its MAC address. Obviously, 
the monitoring station can identify the transmitter 
address only when the packet is received correctly. 
Thus, within B observation slots, the number of 
successful packets is not fixed. Let n m {t) be the 
uplink load measurement performed during the t- 
th measurement interval by a given station. The 
estimation n of the number of contending stations 
is then performed as: 



n(t) = Sn{t - 1) + (1 - S)n m (t) 



(13) 



where 5 is the filter memory. 

The filter performance is critically affected by 
the time window B. Indeed, n m is a cumulative 
parameter, which does not allow to identify all the 
different transmitters summed up to n m . Therefore, 
the filter memory is not able to correct a par- 
tial counting of the actual number of contending 
stations during B. In other words, B has to be 
carefully tuned, in order to guarantee a reasonable 
probability to catch all the contending stations in 
each measurement interval. Consider for example 
the case in which n = 4 and B is set to a value 
able to include only two successful transmissions. 
Assuming that stations 1 and 3 transmit during the 
first measurement interval, stations 2 and 4 during 
the second one, and so on, the filter will consider 
n m (0) = n m (l) = • • -n m (t) = 2, thus providing a 
wrong estimate of n. A solution can be an automatic 
tuning of B according to the comparison of mea- 
surements performed during different observations 
intervals (e.g. B and 2B). In our previous example, 
by counting n m during B and during 2B, we find 



n m (B,0) = 2,n m (2B,0) = 4. Since the measure- 
ment performed during 2B is higher than the one 
performed during B, the station understands that B 
has to be increased. Due to space constraints, we 
omit the details of the automatic tuning of B. 

In order to measure the channel access probability 
employed by the AP, the monitoring station has to 
count the number tx A p of successful transmissions 
performed by the AP during B. Given that there is 
no way of understanding which station has transmit- 
ted in a collision slot, the station has also to count 
the total number of collisions C for measuring the 
T Ap(t) parameter in the i-th time interval as t §z^- 
The estimation f AP is then performed as: 

f AP (t) = /3f AP (t - 1) + (1 - P)r%p(t) (14) 

where (3 is the filter memory. Note that the B setting 
is not critical for this estimator. 



B. Best response performance under legacy AP 

As discussed in section [nj in case of unidirec- 
tional traffic, the best response strategy leads to 
very poor throughput performance under legacy AP. 
Therefore, we analyze the case k = oo only in 
the next subsection, when the AP acts as a game 
designer. 

For k < oo, a generic station i may implement a 
best response strategy, on the basis of the previous 
estimators and ([5]), by setting its channel access 
probability to: 



T 



(fir) 



TAp(t) 



h(t) - (n(f) - k)r AP {t). 



(15) 



In order to evaluate the effectiveness of this 
scheme (approximating the performance of an ideal 
best response in which all stations exactly know 
the network load), we run some simulations. We 
extended the custom-made C++ simulation platform 
used in [7]. We considered an 802.1 lg physical rate, 
with the data rate set to 6Mbps. The contention 
windows used by the AP have been set to the 
legacy values CW m i n = 16 and CW max = 1024. 
All the simulation results have been obtained by 
averaging 10 different simulation experiments last- 
ing 10s, leading to a confidence interval lower than 
3%. Unless otherwise specified, the measurement 
interval B has been set to 500 channel slots (which 
averagely correspond to 300ms). Figure [8] compares 
the behavior of our scheme with standard DCF 
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Each point refers to a network scenario in which n 
stations (indicated in the x axis), with an application 
requirement k, compete on the channel with a 
legacy AP. The aggregated uplink throughput (i.e. 
the sum of the throughput perceived by all the mo- 
bile stations) and k times the aggregated downlink 
throughput, (i.e. the AP throughput) are indicated 
by the y axis, respectively by white and black 
points. From the figure, it is evident that, as the 
number of contending stations increases, standard 
DCF gives very poor performance to the downlink 
throughput. Conversely, for k — 1 our scheme is 
able to equalize uplink and downlink throughput for 
each n, and even in congested network conditions. 
Moreover, it is also able to maintain the overall 
network throughput (i.e. the sum of the aggregated 
uplink and downlink throughput) almost indepen- 
dent on the network load. For example, for n = 20 
the sum of the uplink and downlink throughout 
is about 3.8 Mbps for standard DCF and about 
5 Mbps for our scheme. The figure also proves 
our scheme effectiveness for different application 
requirements (i.e. k = 0.5). The figure clearly 
visualizes that ^ = knSd as expected. We 
also considered our scheme performance in time- 
varying load conditions, by running several simula- 
tion experiments in which the number of contending 
stations dynamically changes during the simulation 
time. Figure [9] shows a simulation example lasting 
300 seconds, in which we start with 5 contending 
stations, add 5 more stations after 100 seconds and 
switch 3 stations off at 200 seconds. In the figure we 
plot the throughput perceived by a given reference 
station (namely, station labeled as station 1) and 
the aggregated throughput perceived by the AP. 
The figure also plots the number of active stations 
estimated by station 1 on the basis of the load filter 
defined in (fl3l, for 5 = 0.7 and B = 500 slots. We 



can draw some interesting observations. First, the 
average AP throughput is basically independent on 
the number of stations contending in the network. 
This behavior is very different from the standard 
DCF behavior, according to which the AP behaves 
as a normal station. Second, the station throughput 
is approximately equal to the desired ratio, i.e. to 
1/n the downlink throughput. 

Note also that our scheme is different from a 
classical prioritization scheme, such as the schemes 
defined in the EDCA extensions. Indeed, by giving 
lower contention windows to the AP (i.e. an higher 



EDCA priority class to the AP), it is not possible 
to perform a desired resource repartition between 
uplink and downlink which is also load independent. 
Finally, the load estimator gives quite stable results, 
because the number of different stations listened in 
each temporal window B is always equal to the 
total number of contending stations. We can see that 
only in the interval between 100 and 200 seconds, 
during which we have a number of contending 
stations equal to 10, the estimator gives a number 
of contending stations lower than the actual one 
for two times. In fact, since we are monitoring a 
fixed number of slots, as the number of stations 
in the network grows, it is more likely that in 
some intervals some stations do not succeed in 
transmitting on the channel. 

C. Best response performance under AP mechanism 
design 

In order to enable the AP to play the role of 
game designer, it is necessary to equip this central 
node with different estimation functions. In case of 
bidirectional traffic, the AP has to estimate the num- 
ber of competing stations n for tuning its channel 
access probability to the optimal value given by 
([8]). For this purpose, the same estimator defined 



in (IV-A) can be used at the AP side. We assume 



that the application requirement k does not need 
to be estimated, because it is known a priori (e.g. 
by means of signaling messages). In turns, each 
contending station runs an estimator of the AP 
channel access probability in order to perform a best 
response strategy adjustment according to ([6]). 

Figure [10] shows an example of uplink and down- 
link throughput repartition when the AP implements 
a mechanism design scheme. The simulation refers 
to an experiment lasting 110 seconds, with 10 
competing stations, a packet pay load of 1500 byte, 
an 802.11 PHY layer at 11 Mbps, and k = 0.5. 
For comparison, the figure also plots the throughput 
repartition perceived under a legacy AP. The figure 
shows that the mechanism design implementation 
is effective in improving the downlink and uplink 
throughput performance. Such an improvement can 
be obtained despite of the simplicity of the em- 
ployed load estimators. 

When k — > oo, the implementation of the ACK 
suppression scheme requires that the AP evaluates: 
i) the 7 threshold, which depends on the estimate of 
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n; ii) the a coefficient, which is simply related to 
7 and to the PHY parameters T and a; iii) the per- 
station channel access probability n, i — 1, 2, • • • n, 
which require n estimators similar to the one used 
by the stations for evaluating tap- During the ob- 
servation interval B, the AP has to separately count 
the successful transmissions tx,i performed by each 
station i, and the number of collisions C, for mea- 
suring r 4 m as tXi/(B — C). These measurements can 
be filtered with the usual auto-regressive filter. The 
implementation of the ACK suppression scheme at 
the AP side has important implications for prevent- 
ing users and card manufacturers from using non- 
standard contention window values. As proved in 
0, currently there is an impressive proliferation of 
cheating cards, i.e. cards which implement lower 
contention windows for taking advantage during the 
contention with other cards. 



Figure 1 1 shows a simulation example (reproduc- 
ing one of the realistic scenarios documented in [3J), 
in which a cheater card with a contention window 
equal to 8 compete with one legacy card. The figure 
refers to a simulation experiment lasting 105 sec- 
onds, after a transient phase of 10 seconds. Despite 
of the temporal fluctuations, it is evident that the 
cheating card obtains a throughput (dashed line) 
higher than two times the throughput (bold line) per- 



ceived by the legacy card. Figure [TT] also plots the 
throughput performance of the two stations when 
the AP implement the ACK suppression scheme. In 
this case, the cheater is no longer motivated to use a 
channel access probability higher than the contend- 
ing station, since its throughput is maximized tuning 
the channel access probability to the threshold value 
1/2a/T/2ct (i.e. implementing a best response strat- 
egy). The figure shows the throughput performance 
of the two stations implementing the best response 
(bold and dashed lines labeled as best response), and 
the throughput degradation perceived by the cheater 
station by still using a contention window equal to 8. 
The ACK suppression scheme works properly, even 
if the AP relies on the channel access probability 
estimators, rather than on the actual values. 

In order to asses the effectiveness of the ACK 
suppression implementation as the number of com- 



peting stations grows, figure 12 compares the ag- 
gregated network throughput of our scheme with 
the standard DCF one, for different n values. Each 
point refers to a network scenario in which n 
stations (indicated in the x axis) are aware of the 



ACK suppression risk and employ a consequent best 
response strategy. Although the variance of the 
estimators could imply that in some intervals such a 
probability passes the threshold value (i.e. there is a 
non null probability of unnecessary ACK dropping), 
the figure shows that the aggregated throughput 
is almost constant regardless of the number of 
competing stations. This behavior is very different 
from standard DCF, whose efficiency depends on 
the number of contending stations and degrades for 
high load conditions. Therefore, our scheme is able 
not only to discourage cheating card behaviors, but 
also to optimize the global network performance. 

V. Conclusions 

The proliferation of MAC-level programmable 
WiFi cards can potentially create serious coexis- 
tence problems, since some stations can implement 
greedy access policies for increasing their band- 
width share at the expenses of compliant users. For 
this reason, we proposed a game-theoretic analysis 
of persistent access schemes for WiFi infrastructure 
networks, in order to characterize equilibria con- 
ditions and to design disincentive mechanisms for 
inefficient behaviors. We proved that, when stations 
are interested in both uploading and downloading 
traffic, an homogeneous Nash Equilibrium arises, 
where all the stations reach the same non-null 
utility. Moreover, we also explored the utilization 
of the Access Point as an arbitrator for improving 
the global network performance. Specifically, we 
proposed two different solutions. When the required 
download traffic is different from zero, the AP 
can simply tune its channel access probability for 
controlling the station best response. When the 
downlink traffic is zero (or negligible), the AP 
can selectively discard the acknowledgments of too 
greedy stations. 

We proposed some extensions to standard DCF, 
in order to i) estimate the network status, and ii) 
emulate an access scheme based on best response 
strategies and AP mechanism design. We proved 
the effectiveness of our solutions for controlling 
the resource sharing in WiFi networks in various 
network scenarios. We are currently investigating 
on the prototyping of our solutions in actual WiFi 
cards and APs. While the estimate and best response 
modules can be simply implemented at the driver 
level, the ACK dropping scheme requires a hard- 
ware/firmware update. 
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Fig. 1. An example of DCF operation and equivalent slotted model. 



Fig. 3. Station utility in case of homogeneous access probability 
employed by all the stations and different k values. 
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Fig. 2. Utility of a given station i, 
function of the strategy r 4 (k — 1). 
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Fig. 4. Station utility at 1 1 Mbps in case of homogeneous access 
probability, n=10, k=l, and different tap values. 
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Fig. 5. Station utility at 600 Mbps in case of homogeneous access 
probability, n=10, k=l, and different tap values. 



Fig. 7. Station utility in case of ACK suppression, for n=10, k=oo 
and different a values. 
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Fig. 6. Station utility at NE as a function of the desired NE 
outcome, for n=10 and different k values. Maximum station utility 
under mechanism design (white boxes), station utility at NE under 
legacy AP (black boxes). 



Fig. 8. Aggregated throughput for various number of nodes. 
Comparison of our scheme (k — 1, k — 0.5) with standard DCF. 
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Fig. 9. Effects of best response strategies (k = 1) on AP aggregated 
throughput and a given node throughput in dynamic load conditions. 
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Fig. 1 1 . Effects of the ACK suppression scheme on a cheater station 
throughput. 




Fig. 10. Uplink and downlink throughput repartition at 11 Mbps 
under legacy AP (dashed lines) or mechanism design (bold lines), for 
n=10, k=0.5. 



Fig. 12. Unidirectional case (k = oo): aggregated throughput for 
various number of nodes. Best response is compared with standard 
DCF 



